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Abstract. Bilinear estimates for the wave equation in Minkowski space are 
normally proven using the Fourier transform and Plancherel's theorem. How- 
ever, such methods are difficult to carry over to non-flat situations (such as 
wave equations with rough metrics, or connections with non-zero curvature). 
In this note we describe an alternative physical space approach which relies on 
vector fields, energy estimates as well as tube localization, splitting into coarse 
and fine scales, and induction on scales (in the spirit of Wolff [EIJ], pol). 



1. Introduction 



In this paper n > 2 is an integer, and all implicit constants may depend on n. 

We shall call a free wave any (possibly vector-valued) solutiorj^ (p to the free wave 
equation 

in R"^^. We write 4>[t\ :— {4>{t), 4>t{t)) for the data of (p at t. This space is equipped 
with the energy inner product 

:= i / '^(t)[x,t)-y'^{x,t)+<Pt{x,t)^t{x,t) dx, 
2 JR" 

and we write E{4)[t\) := 0[i])e- If <t>: i' are free waves, then these quantities are 
time-independent and we shall simply write them as (0, V')e and £'(</>) respectively. 

If is a free wave and A is a dyadic number (i.e. an integer power of two), we say 
that (j) has frequency A if the spatial Fourier transform 0(^, t) is supported in the 
regiorn |^| ~ A; note that this property is independent^ of t. 



1991 Mathematics Subject Classification. 35J10. 

^To avoid technicalities we shall assume that our waves are Schwartz functions in space. 

^Here and in the sequel, we use C to denote various constants depending only on n and a 
parameter e which will appear later in the paper, and we use A < B to denote the estimate 
A < CB. Finally, we use A ~ B to denote the estimate A < B < A. 

^ The time independence property is invalid in a curved spacetime. There are however tech- 
niques, based on paradifTerential calculus, which allow one to circumvent this difficulty, see 
and 
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Let A be any dyadic number. We use the symbol Px to denote any Littlewood-Paley 
projection operators of the form 

where rj is any bump function adapted to the annulus |^| 1; the exact choice of 
T] may vary from hue to hue. Note that if is a free wave, then Pa0 is a free wave 
with frequency A. Also, if </> has frequency A, then we have the reproducing formula 

V^^ = A''-«PaVS<^ (1) 

for any a, 6 > and some suitable P\ (possibly tensor- valued) . Here V!J refers to 
fractional spatial derivatives. By computing the convolution kernel of P\ wc thus 
obtain the pointwise estimates 

|V^</>(xo)| < A^-'' J mixo + y/A)|(l + \y\y dy (2) 

for any M > 0, with the implicit constant depending on a, 6, M. (Similar estimates 
obtain for spacetime derivatives Wx,t by using the equation 0(f) = 0). In particular 
we have the Sobolev inequality 

||<^||oo<A"/2-ii?(0)i/2 (3) 

whenever (/) is a free wave of frequency A. In practice, when we wish to estimate 
a bilinear expression Q{(j),ip) in L^, we shall put the lower frequency wave in L°° 
(using (H) or more sophisticated variants), and the higher frequency wave in 
(using energy estimates, possibly involving vector fields). 

In the study of non- linear wave equations it has been realized that bilinear estimates 
on free waves play a fundamental role. Two sample estimates are given below; for 
a more thorough discussion of such estimates see , ||] , |Q , jisj , . 

Theorem 1.1. Let A, fi be dyadic numbers, and let e > 0. Then we have the 
estimate 

II'3(?^':V')IIl2(Q^) ^^^min(A,Ai)'^+^£^(?!')^/^£^(V')^/^ when i? min(A, /i) > 1 
i?i/2jnin(A,Ai)^-B((/))i/2^(V')^/^ when i?min(A, /i) < 1 (4) 

whenever is a spacetime cube of some side-length R > 0, (j) and ip are free waves 
with frequency A and fj, respectively, and Q is one of the null forms 

Qo(0» : = - V0 • VV', 

Theorem 1.2. Let ^, A be dyadic numbers, and let (j), ip be free waves of frequency 
A. Then we have 

||P^(0^)||^,/.^./.(R-.+i) < (^)""t-'^A"-2-i-^£;(0)i/2i5(^)i/2 

whenever {q, r) are an admissible pair of Strichartz exponents in the sense that 

1 (n- l)/2 (n- l)/2 , , , , 
q r 2 
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Apart from the e loss in Theorem all the exponents are sharp ^ . The e can 
be removed; we will discuss this issue later in the paper. 

These two theorems apply only to frequency-localized free waves, but one can use 
Littlewood-Paley decomposition and orthogonality in a standard manner to extend 
these theorems to arbitrary free waves. More precisely, we have 

Corollary 1.3 ("First generation" null form estimate, see [0). IfQ is a null form 
and (j), ip are free waves, then 

for any e > 0, where (D) is the Fourier multiplier with symbol (1 + I^^P)"'^/^. 

Corollary 1.4 (Bilinear improvement to Strichartz' inequality). Let <f>, ip he free 
waves, and let {q, r) be an admissible pair of Strichartz exponents. Then we have 

whenever a < n — ^ — ^ , and \D\ is the Fourier multiplier with symbol |^|. 



The derivation of these co roll aries from the Theorems is given in the Appendix. 
We remark that Corollary L3 was proven in (with the shar p en dpoint e = 0) 
by methods heavily reliant on the Fourier transform. Corollary 1.4 was similarly 
proven using Fourier transform methods in jl^ , see also . 



The purpose of this note is to give different proofs of Theorems 1.1, 1.2 and their 
corollaries which rely on physical space methods instead of the Fourier transform. 
As such, we believe these arguments will be useful for such situations as quasilinear 
wave equations with rough data, in which the Fourier transform is difficult to use 
directly. The techniques used here extend to many other types of bilinear estimates, 
but we shall restrict ourselves to the above two types of estimates for simplicity. 



We briefly outline the ideas used in our arguments: 



• Splitting physical space into coarse scales and fine scales. If a large 
cube Qji is partitioned into disjoint^ sub-cubes Q^, we can split 

The inner LP(Q^) norm represents the fine scales, while the outer F norm 
represents the coarse scales. The strategy is then to choose r carefully and 
handle the coarse and fine scales in different ways. This should be compared 
with Fourier-based methods, which often employ a similar decomposition but 
in the frequency variable (jT^, |Q, etc.). 

• Decomposition into wave packets. In a spacetime cube Qj^, one can 
split a free wave (j) into almost orthogonal "wave packets" (f)T, each of which 
are also free waves and are essentially localized in a tube T, which is a suit- 
able neighbourhood of a light ray. The traditional decomposition into wave 



^We always ignore sets of measure zero when we say that two sets are disjoint. 



SERGIU KLAINERMAN, IGOR RODNIANSKI, AND TERENCE TAO 



packets, which has its origin in the so called second dyadic decomposition^ 
pioneered by FefFerman and Cordoba in connection to the multiplier 
problem, relies heavily on the Fourier transform. 

In our paper we suggest a different construction of wave packets based on 
a "dynamical" physical space decomposition which avoids Fourier space de- 
compositions and the related method of "stationary phase" . This is achieved 
by localizing our waves at two distinct times. One can use the vectorficld 
method, instead of stationary phase, to show that the resulting waves con- 
centrate along tubes. 

Distinguishing between transverse and parallel interactions. Once 
4> and V' are decomposed into wave packets, one can then separate into two 
cases depending on the angle between the two packets. When the angle is 
small, we say we have a parallel interaction, and we expect the cancellation in 
the null structure to play a role. In the large angle case we have a transverse 
interaction, which we expect to be well localized in physical space. In the 
original approach to the proof of bilinear estimates the two cases^ were both 
treated with the help of the spacetime Fourier transform. In some cases one 
can use Lorentz transforms to reduce the parallel case to the transverse case: 
see [|6|, m, |§ for further discussion. 

The new twist in our paper is the treatment of parallel interactions, for 
which the vectorfield method turns out to be very well suited. Together with 
the dynamic tube localization and the treatment of transversal interactions, 
based on induction on scales initiated in |26|, Q, this allows 

us to give an almost^ completely geometric treatment of the first bilinear 
estimate, see Theorem 



1.1 



Induction on scales (or bootstrap arguments). To prove an estimate 
at a large spatial scale R, assume inductively that an estimate has already 
been obtained at smaller scales r. This idea has been around for some time 
(see e.g. (|, ||), but only in the recent work of Wolff |2|, |^ was it 
realized that such techniques can give very sharp estimates. 
Improved decay estimates. The original proof of Theorem 1.2 was based 
on an improved — L°° decay estimate which takes advantage of the size of 
the Fourier support of the data in a small rectangle in Fourier space. 

Here we take a different approach, first developed by , based on a similar 
phenomenon in physical space. This hinges on the observation that "bump 
waves" , i.e. solution of the free wave equation with initial data localized at 
frequency one and decreasing rapidly away from the unit ball centered at the 
origin, are essentially supported on a set of codimcnsion one. In flat space this 
can be easily seen from the explicit representation of bump waves as Fourier 
integrals. Due to this fact one can improve the L°° decay of bump waves by 
averaging over a relatively coarse scale in physical space. Here we show that 
this improved decay property can be derived by vectorfield techniques in the 
spirit of Q. Thus, by avoiding the explicit representation of solutions, we 



These were developed further by Bourgain |^ and later authors (||^, jis), ||o), 

or more precisely their Fourier space counterparts. 
With the exception of the dyadic decomposition. 
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give an essentially geometric derivation of Theorem 1.2. This opens the way, 
we believe, for significant generalizations to quasilinear equations^. 

By frequency-localized fundamental solution we mean a free wave of fre- 
quency 1 whose initial data is rapidly decreasing away from the origin. It is 
well known that the L°° decay of such a solution in time can be combined 
with energy estimates to yield Strichartz estimates. However, there are other 
properties of the solution which are also useful. For instance, the fundamental 
solution is essentially supported on a set of codimension 1, which means that 
one can improve the L°° decay of the (absolute value of) the fundamental 
solution by averaging over a relatively coarse scale in physical space. This 
fact (first used in ) can be used to control the low- frequency portion of a 
bilinear or nonlinear expression. We remark that vector fields methods are 
well suited for obtaining this type of control on the fundamental solution, 
even in rough metrics (see e.g. 



In the proof of Theorem LI we shall use the first four ideas, while in the proof 
of Theorem |l.2| we shall use the first and fifth ideas. To realize these ideas we 
shall use energy estimates and commutation with special vectorfields in the spirit 
of see also Q. It is possible to use other methods (e.g. Fourier integral 
parametrices) as a substitute for vector field methods, but we believe the vector 
field approach is better adapted for more non-linear situations (e.g. quasilinear 
wave equations) in which the Fourier transform is less amenable. This paper is 



organized as follows. We prove Theorem Fl in Sections || to 0. In Section ^ we 



set up the basic machinery of tubes or wave packets, and in Section |3| we combine 
this with an induction on scales argument to reduce to the parallel interaction case. 
After some vector fields preliminaries in Section ^ we handle the parallel interaction 
case in Section |[ Finally, we give two proofs of the key tube localization lemma 
(one a hybrid of spatial and Fourier localization methods, and the other based 
entirely on spatial localization) in Sections || and 0. After some remarks in Section 



we then prove Theorem 1.2 in Section 



The first author is supported by NSF grant DMS-0070696. The second author is 
supported by NSF grant DMS-0 107791. The third author is a Clay Prize Fellow and 
is supported by a grant from the Packard Foundation. This work was conducted at 
UCLA, Princeton, Oberwolfach, and ETH Zurich; the first and third authors are 
especially grateful for Michael Struwe's hospitality during their stay at ETH. 



2. Tube localization 



A fundamental tool in our proof of Theorem 1.1 is the ability to decompose a free 
wave into "wave packets" which are concentrated in tubes along null rays. To make 
this precise we need some notation. We shall be working in the standard spacetime 
cube 

q2^= {(a;,i) : < < i? for aU i = 1,. .. ,n; R<t<2R} (6) 



* The recent result of jlj is based on a generalization of the biUnear estimates in Theorem |l.2| 
to wave equations with rough coefficients. The construction is based on Fourier Integral operators. 
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for some R > 1. Note that this cube is some distance away from the initial spa- 
tial hypersurface i = 0; this will be very convenient when we apply vector field 
techniques. 

We shall also need a small constant < e ^ 1; all our implicit constants may 
depend on e. 

Definition 2.1. Let 1 r < R he scales, let uj E 5""^ be a direction, and let 
xq £ R" be a point in space. We call the set 

T ^T{R,r,uj,xo) '-^ {{x,t) : \x - tuj ~ xo\ <r;0<t <3R} 

the tube with length R, width r, velocity w, and initial position Xg. 

We now define the notion of a wave packet localized to a tube. We shall normalize 
our definition in the situation when the original waves (j), tp are normalized to have 
energy 1. 

Definition 2.2. Let T be a tube and (j>T be a free wave with frequency /i > 1 and 
energy E(<j)T) ^ 1. We say that 4>t is a wave packet localized to T if one has the 
estimates 

\^.,tM^.t)\ < (i? + dist((x,i),T))-i"%"/2 (7) 
whenever R/2 < t < 5R/2 and {x, t) ^ T, as well as the variant 

( / I V,,t0Tp dx)^'^ <{R + dist((0, t), r))-i"0" (8) 

J x:\x\<R,(x,t)l^T 

for aU R/2<t< 5R/2. 

The estimate essentially follows from (^) by (^) , perhaps at the cost of replacing 
T by a tube of twice the radius. (The error term coming from inside the tube can 
be controlled by (|3|)). In practice we shall use (Q) for low- frequency waves and (||) 
for high-frequency waves. 

The main tube decomposition lemma can now be stated as follows. 

Lemma 2.3 (Tube decomposition lemma). Letcj) be a free wave with energy E{(j)) = 
1 and frequency fJ, ^ I, and let R > 1 and ^ ^ Then there exists a 

collection T of tubes of length R and width r, and to each tube T inT there exists a 
wave packet (pT localized to T with frequency fi, such that we have the decomposition 

(j>= ^ + (f>error (9) 

TeT 

where the error term is a free wave which satisfies E[(j)f.rror) ^ Also, we 

have the Bessel inequality 

E ^(^t) < (10) 
TeT 

and the almost orthogonality relationship 

TeT' TeT' 



BILINEAR ESTIMATES 



7 



for any subset T' of T. Finally, to control error terms we require the technical 
property that given any cube Q^, the number of tubes in T which intersect is 
0(i?iO"). 



At a heuristic level one should imagine that iperror = 0, and that the (j)T are an 
orthogonal decomposition of 0. 

This type of lemma has been used by many authors. For the closely related re- 
striction problem for the Fourier transform, these wave packets first appear in the 
work of Fefferman, Knapp, and Cordoba, and were developed substantially further 
by Bourgain [|, §, Wolff |§, and others (e.g. ||, In these papers the 

tube decomposition is achieved by a decomposition in both physical and frequency 
space. We shall give two proofs of this lemma in Sections ||, 0. The first proof is 
similar to the previous arguments but only uses the Fourier transform at the level of 
the initial data rather than in spacetime, and uses commutation with vector fields 
to control the localization. The second proof is based purely on physical space 
methods, and achieves localization to a tube by localizing the wave to a disk at two 
different times (see somewhat related "slices" approach to the Kakeya problem in 
e.g. §). 

In the high frequency case /i ^ 1 the condition r > can be relaxed to 

r > but we are unable to exploit this for our bilinear estimates 

because our argument needs <j) and "0 to be decomposed into tubes of equal width. 



Assume Lemma 2.3 for the moment. In the next section we see how this lemma can 
be combined with an induction on scales argument to reduce the bilinear estimate 
(^ to the "parallel interaction case" . In the section after that we show how com- 
mutation with vector fields can be used to treat these parallel interactions, thus 
completing the proof of (0) . 



3. Induction on scales 



We now begin the proof of Theorem 1.1. The idea is to use Lemma 2.3 and an 
induction on scales (or bootstrap) argument to reduce (|^) to the parallel interaction 
case. 

We first give some preliminary reductions. Fix A and pi] without loss of generality 
we may assume that X < ii. By rescaling we may take A = 1. 

The case i? < 1 is easy to dispose of by Holder and the Sobolev inequality (||): 

(12) 



Thus we may take i? ^ 1. 
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We now set up the bootstrap argument. For any a > 0, let B{a) denote the 
statement that the bihnear estimate 

IIQ('^'^)IIl^(Q,) ^^"^W'^'^W^' (^") 

holds for all free waves 0, ijj, all i? ^ 1 with frequency 1 and fi respectively, and 
all spacetime cubes Q^; the implicit constant can depend on a and the implicit 
constants used to define such notions as "^l> has frequency /i" . Our objective is to 
prove B{a) holds for all a > 0. 

From (^2|) we have the estimate B{^). The claim will now follow from bootstrapping 
the following Proposition. 

Proposition 3.1. If B{a) holds for some a > 0, then B{{1 — c)a + Ce) holds for 
all £ > Q, where c > is an absolute constant depending only on n. 



Indeed, from Proposition 3.1 it follows that the infimum of all a > for which 
B{a) holds must be zero. 

It remains to prove the Proposition. Fix a > 0, e > 0, and assume that B{a) 
already holds. Fix free waves (j), ip and a scale R ^ 1, and a cube Q^. We may 
assume that we have the normalization E{(j)) = E{ip) = 1. We shall take advantage 
of spacetime translation invariance to set = Q^'', where Q^'' was defined in 
(H); note in particular that the cube is a distance R away from the initial spatial 
hypersurface t — Q. Our objective is to prove that 

llQ(</',V')IL.(Q(^o,^ <i?(i-^)"+^^ (13) 

If one applies the hypothesis B{a) directly to this cube Q^"* one only gets a bound 
of i?". To improve upon this we shall only apply B{a) at smaller scales than i?, 
and use transversality to sum efficiently (in the spirit of ||2^ ) . 

We let <C r <C i? be an intermediate dyadic scale between ^/R and R to be 

chosen later; r will eventually be fairly close to y/R. We now use Lemma 2.3 to 
write 

(j)= (/)T + (j>error; "0 = X] i'T' + ''l^error 

TsT T'eT' 

where T, T' are collections of tubes with length R and width r, and for each T e T, 
is a wave packet localized to T obeying (^, ([Tl|). Similarly for the Vt'- 

The (j)erron ''Perror EH'e Small in energy and their contribution can be iterated away 
by the usual argument]^. We can therefore split the remaining portions of the null 
form -0) into a sum over interactions between wave packets: 

^ ^ Q(0T>T')- (14) 

TeTT'eT' 



^Specifically, one assumes that B(a) holds with an explicit constant A, possibly depending on 
R, and use the small energy of rf>error and tperror to bound the contribution of these terms by at 
most A/10. At the end one can absorb the A/10 into the left hand side. 
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By (0) and Holder one may easily dispose of those tubes T for which dist(r, Q^') ^ 
R. Similarly for T'. Thus we shaU assume in the sequel that dist(r, Q^^), dist(T', Q^^) 
R for aU r e T and T e T'. In particular the cardinality of T and T' is only 
O(i?i0"). 

We shall need an intermediate separation scale r <C po ^ -R to be chosen later; 
eventually po will be close to r. We will assume that po is a dyadic number. 

Informally, the strategy will be as follows. We wish to divide the above sum into 
parallel interactions \xq(T) — a::o(T')| < po and transverse interactions \xo(T) — 
xo{T')\ > Po, where xo{T) denotes the initial position of T. The parallel inter- 
actions should be small thanks to the cancellation in the null form, and we will 
use vector field methods to exploit this. For the transverse interactions, we do not 
exploit the null structure, but instead we use transversality to ensure that TnT' is 
localized to a cube of length ^ R, so that when we apply the hypothesis B{a) we 
shall gain in the a exponent. Of course we still have to sum over all tubes T, T', 
but it turns out that we have enough orthogonality in (|l^), (|ll| ) to achieve this. 

We now turn to the details. For technical reasons we have to take a little more care 
in dividing the wave packet interactions into parallel and transverse interactions in 
order for the transverse interactions to decouple nicely (cf. ||2^). Specifically, we 
shall use a bilinear Whitney decomposition of the initial data. 

The initial positions xq of the tubes in T, T' can all be contained in a spatial cube 
K of dyadic side-length CR. Let K denote the set of all dyadic sub-cubes k of K 
whose side-length 1{k) is at least po/2. Without loss of generality we may assume 
that the initial positions xq of tubes in T, T' never lie on the boundary Ok of any 
cube K e K. 

If K, k' € K are two such sub-cubes of the same side-length, we say that k and k' 
are close if n is not adjacent to k', but the parent of k is adjacent to the parent of 
k'. We write k ^ k' to denote the statement that k and k' are close. Note that if 
K ^ k', then dist(K, k') is comparable to the side-length of n or k' . 

For each k S K, define the sets 

T(k) := {T e T : a;o(r) e k} and T'(k) := {T' e T' : xa{T') e k}; 

these represent the tubes emanating from the cube n. A simple stopping time 
argument reveals that for any T E T and T' E T', exactly one of the following 
statements is true: 

• (Transverse interaction) There exists a close pair of cubes k ~ k' such that 
T e T(k) and T' e T'(k'). 

• (Parallel interaction) There exists cubes k, k' of side-length 1{k) = po which 
are either equal or adjacent such that T G T(k) and T' e T'(k'). (We shall 
write K « k' in this case. 
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We can therefore split the nuU form ( |14| ) into the transverse interaction component 

Qi^-^^>^') (15) 

and the parallel interaction component 

QiK.i^.') (16) 

where 

(j)^ := Y and Vk Y ' 

tgT(k) T'eT'(K) 
InformaUy, 0^ represents the portion of cj) emanating from k, and similarly for ip,^. 



Consider first the contribution of the transverse component (|15|). Observe that 
there are only 0(log R) possible values for the side-length 1{k). Since we are allowing 
ourselves to lose an W in the final estimate (jl^) , it suffices by the triangle inequality 
to show that 

II E Q(0«,^.')IU.(Q(^")) <i?"-'^+''^ 

for some cr > and for &\\ Pq < p < 1. 

Fix p. We use the triangle inequality to pull the sum in k, k' outside the norm. 
Observe that for each k there are only 0(1) cubes k,' which are close to k, and vice 
versa. By Cauchy-Schwarz and Bessel's inequality ( p^ it thus suffices to show that 

l|Q(</'ft,^K')IL.(Q(^o,^<i?"--+^^( E{<j^T)Y'\ Y E{i^T-)Y'^+R-^°- 

TgT(k) T'6T'(«:') (17) 

for all close pairs of cubes k, k' of side-length p. 

Fix K, k'. Observe that if two tubes T, T' in the above expression intersect, the 
angle between T and T' is comparable^ to p/R- Since T,T' have width r and 
P ^ Po ^ we thus see by elementary geometry that TdT' is contained in a cube 
of side-length 0{Rr/p) ^ R. To exploit this we shall subdivide the cube into 
sub-cubes Qi^^/p of side- length Rr/p. We can then split left-hand side of ( [l7| ) as 



1/2 

Q 



( E ii^?(^-^«')iii.(Q^^^^)) 

(Cf. ®). 



Consider the contribution of a single cube Q^^/p- Intuitively we only expect the 
portion of 4>, which propagates along the backward light cone from Qjj^/p to 
contribute to this portion. To make this intuition precise, we will now throw away 
the tubes T, T' which do not intersect Qij^/p- 



^"This crucial fact, that separation in space implies separation in angle, relies on the choice of 
Q^"* to be away from the initial spatial hypersurface t = Q. 
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Consider first the contribution of a single tube T e T(k) wliicli does not intersect 
Q,jir/p- By Holder, Bessel (p^, and the decay of 0t outside of T, we have 

||Q(0T,'0k')IIl2(Q^^^^) ^ ( — ) l|Vx.t0T||L-J|V^,f-0K.||iooi2 



< 

P 



Summing this over all tubes T and all cubes Q^^/p that this contribution is 

acceptable. Thus we may replace by X]TeT(K) TnQ 5<i0 Similarly for ■(/'k' 
(but we continue to keep the low frequency in L!^ and the high frequency in L'^). 

We are now ready to apply the hypothesis B{a). Modulo acceptable errors, we 
thus have 

Rr 

T6T(«:):TnQ„,,/^5^0 T'eT'(K'):T'nQ^^/^5^0 

Applying (|ll|) we obtain 

^ reT(«:):TnQ^^/^#0T'eT'(K'):T'nQ^^/^5^0 

The expression inside the square root may be thought of as a measure of how much 
joint energy of and ^ can interact within the cube Q^r/p- The point is that 
different cubes Q^r/p '^^^1 require disjoint portions of the joint energy because of 
transversality, and this will allow us to sum. Indeed, if we insert the above bound 
into the left-hand side of (p^), we obtain a total bound of 

(-)"^''^(E E E E{^r)Ei^PT'))'/'. 

^ Q«./p TeT(«):TnQ„^/^#0 T'eT'(,.'):T'nQ„^/^#0 

By transversality of k and k' and elementary geometry, we see that for each T S 
T(k), T' G T'(k) there are only 0(1) cubes Q^^/p which the above summation 
is non- vacuous. Thus we can bound the previous by 

{Rr/prn^^ E E E{c^T)E{^T')r^'. 
TeT(K)T'eT'(K') 

Since p> po, we see that this is acceptable if we set 

po R'r (18) 

for some small absolute constant c > to be chosen later. 



It remains to control the parallel interactions (^^. By Bessel's inequality (|10|) and 
the same Cauchy-Schwarz trick used to reduce the transverse case to (p^, we may 
reduce this portion of the estimate to 

iiQ('^-V'K')ii^,(Q(o,^<i?(i-^)"i?^^( E Eiq^T)y/^(. E Ei^T')y/^+R-'°'' 

tgT(k) T'eT'(K') (19) 
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for all cubes k sa k' of side-length 1{k) = 1{k') = po- 
Fix K, k' and expand 

IIQ(0«,V'«')II^2(Q(")) = II Qi<PT,tpT')\\^2f^Q(0)y 

TGT(K),T'eT'(K') 

By arguments similar to those in the transverse case, we may use the decay estimate 
(0) to insert a characteristic function xt in front of the interaction Q{(j)T, ipx') since 
the error is acceptable. Similarly we may use (|^) to insert a xt' in front of tpT' ■ 
Thus we can restrict to those pairs T, T' of tubes which intersect. But in the 
parallel interaction case we see from the definition (|l^) of pa that each T only 
intersects 0{R'-^'^) tubes T' . Also, the tubes T only overlap by a multiplicity of 
0{R^'^). Thus we may estimate the left-hand side of (p^ ) by 

R'^'iW E Q(0T,^T')lP^.(Q(o,//'. 

TeT(K),T'eT'(K'):TnT'#0 
We use the triangle inequality to pull the summations outside the norm. From 



another Cauchy- Schwartz and application of (11), (|10|), we see that it suffices to 
show that 

for all T e T(k) and T' e T'(k') such that T n T' 7^ 0. Indeed we shall show the 
stronger estimate 

||Q(0T, V'T')IL.(Q(^o,^ < R-f'E{cl,T)'^^E{i;T')'^^ + R-^°- (20) 

for some absolute constant /? depending only on n, if c was sufficiently small and 
r was sufficiently close to VR. This shall be deferred to Section |^, after some 
preliminaries on vector fields. 



4. A DIGRESSION ON VECTOR FIELDS 



In this section we introduce our main tool for obtaining decay and localization 
estimates, namely the Killing and conformal Killing vector fields (see 13 , |lj] for 



a more comprehensive discussion). We begin with an informal discussion. 



We are interested in those vector fields X — X^da which preserve the space of free 
waves. In addition to the constant co-efficient vector fields (9i, . . . , 9„, dt), one also 
has the Poincare group of vector fields 



S 

u 



= Xidt 

— Xidj 



rdr 

\- tdi ioi 1 < i < n 

- Xjdi. for 1 < I < j < n 



(Scahng) 
(Lorentz boosts) 
(Angular rotations) 



We use 

r = r^"'*^-* := (5, Li, . . . , L„, VLi^2i ■ ■ ■ , ^n~l,n) 

to denote the entire collection of Poincare vector fields, thus F is a vector-valued 
first order differentiation operator. We can then define F'' for any fc > 1 as a 
fc-vector-valued fc*'' order differentiation operator. 
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The Poincare vector fields F = F^"^") are centered at the origin, but we may also 
translate them in spacetime to be centered at an arbitrary point {xq, to), creating a 
new set of conformal Killing vector fields F^^^''^^ This will turn out to be important 
for tube localization. 

If (/) is a free wave and Z3 is a collection of conformal Killing vector fields (e.g. 
D — F), then D'^c/i is also a free wave, and in particular we have the energy identity 

\\^x,tD''cl){t)h = WW^.tD'^mh- 

To exploit this we seek to use vector fields D which are "large" at time t and 
"small" at time zero. In practice, this means that we try to ensure 4>(0) is localized 
in physical space (in order to make the weights associated with the Xj small at time 
zero), and at time t we interest ourselves with the regions where the vector fields 
D are large and span the tangent space. In such regions we will obtain extremely 
good decay estimates^. We observe the convenient fact that if 4> has frequency /i 
and D is any collection of constant co-efficient or Poincare vector fields, then D''(j) 
also has frequency fi. 

To localize a wave to a cone centered at the origin (0, 0) it is enough to use the 
Poincare vector fields F*^°'°^ (in fact one can use only the scaling vector field S to 
do this). Similarly for cones centered around other points {xo,to) in spacetime. To 
localize instead to a tube, say one with velocity ei, one shall need some additional 
(constant coefficient) vector fields, notably 82, ■ ■ ■ ,dn and dt + di. 

We now turn to the details. Let ■0 be a free wave with some frequency fi. Then 
r'^V is also a free wave with frequency fi, and the energy identity yields 

||V,,tFV(t)l|2 < ||V,,tF'=^(0)||2 

for all A: > (here and in the sequel all implicit constants may depend on k). 
On the other hand, if we expand F*"' out using the Leibnitz product rule, and use 
the equation dip = to convert any repeated time derivative to a repeated space 
derivative, we obtain the pointwise estimate 

k 

|V,,tF'=^(a:, 0)1 < Ix^^i^.M^, 0)|. 

Combining these bounds with (^) we obtain the weighted energy estimate 

||V.,tF'=^(i)||2 < \\il + fM\x\fV.,Mx,0)\\Li. (21) 



^^By taking k arbitrarily large, we will in fact obtain arbitrarily fast decay; we will be able to 
justify taking these many derivatives because we will always be working in the frequency-localized 
setting. To some extent this is still true in such apparently rough situations as quasilinear wave 
equations, because of the frequency localization. In this paper we shall differentiate many times 
(e.g. lOOn times) but one can get away with far less differentiation by being more careful with 
the error terms, etc. 
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As a first attempt to exploit this differentiability in F we observe the identities 

t " a; ■ 

j=i 

where we use polar co-ordinates x = rw As a caricature, we thus have something 
like 

If we combine this with repeated applications of the Leibnitz product rule, we 
conclude the pointwise estimate 



\^iM^,t)\< \ ^ ^|r^v(x,t)|. (22) 

This is estimate is good away from the light cone, but is not completely satisfactory 
in the region \x\/2 < \t\ < 2\x\. In this case we introduce the null frame 

:= sgn{t)dt + dr good null direction 
-E- := sgn(i)9t — dr bad null direction 
Ai :^ di — ojidr angular derivatives. 

We write y :— {Ai, . . . , An) for the angular derivatives, and p :— y) for the 
good derivatives. 

These vector fields are not quite conformal Killing, but we can write them in terms 
of the Poincare vector fields: 



it + r)E+ = S + J2^^L^ 
i=l 
n 

{t-r)E. =S-Y,^^L, 



i=l 

r 



tAi — Li ^ ^ LOiLj. 



As a caricature, we thus have 



|t| ' - \\t\-\x\\ ■ 

By repeated applications of the Leibnitz rule, we can thus improve ( p^ ) to 

\P^viMt,x)\ < —-^ ^ (23) 

|iH|t|-|x|pir'o 

near the light cone for all j,k > 0. Informally, this means that if the F vector fields 
are under control, then good derivatives gain a \t\ while bad derivatives only gain 
a\\t\-\x\\. 
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We now specialize the above estimates. Fix R> 1, ^> 1, and suppose that ^ is a 
wave of frequency fi such that E{^) < 1. We also assume that tp is localized around 
the origin at time zero. More precisely, we assume there is a radius 1 < A ^ R 
such that we have the estimate 

(/ |xp«0"|V,,*V(a:,0)|2 dx)i/2<l. (24) 
From we thus obtain 

for all < fc < lOOn. Since T^^ip has frequency ji, we can use the reproducing 
formula @) to then obtain 

\W''i^it)h ^ A'=/"'S(i^)'/' + A-i"""+'^/"'. (25) 



We now specialize to the case when ijj has frequency /J. = 1, and time is restricted 
to the region R < t < 2R. To emphasize the frequency 1 we shall write (f> instead 
of ip. 

First suppose that we are in the region r < \t\/2 or r > 2\t\; in other words, we are 
away from the cone. By ([22[), (03) we see that 



t0(t, x)\ < A'^R-'^Ei^y^^ + A-i"0". (26) 



Now suppose that we are in the region \t\/2 < r < 2\t\, so that r ^ R. From (|23|) 
we have 

\r'^lMt,x)\ < E |r'"v^,,(A(t,x)|. (27) 

We now use the angular Sobolev inequality 

||u||ioo(s„-i) < + ||y""^u||ii(s„-i), 

which is easily verified by n — 1 applications of the fundamental theorem of Calculus. 
Applying this inequality to u :— | j0(i, x)p, we obtain 

\yiMt,x)\'<[ ^ J2 \ryiMt,ru;')\\ryitHt,ru;')\dv'. 

^ i+j<n-l 

By (H^) we thus have 

This is a square of the norm on a sphere of radius r. We then use (||) to average 
out this sphere into what is essentially an annulus of thickness 0(1) around the 
sphere of radius r. This implies that 

0<i<n~l 

Applying ( p5| ) we thus obtain the pointwise estimate 

|V^,,</)(t,x)|<(A)^£;(0)l/2+A-9On. 
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If we combine this with (|2^) we obtain the decay estimate 

l|VL</.(t)||oo<(^)^i?(0)^/' + A-9°" (28) 

for k = lOOn (say). But since ip has frequency 1, this estimate in fact holds for aU 
k by the reproducing formula (0). Finally, we remark that we may replace (f> by Tcf) 
on both sides without affecting the estimate. In particular we have 

H H ^29) 



5. Parallel interactions 



Fix T, T'. To complete the proof of Theorem 1.1 begin in Section ||, we need 
to prove (po|). (We also need to prove Lemma 2_^, but this will be done in later 
sections) . 

In earlier work such as |^ , |^ , the proof of ( pO| ) was accomplished by using 
Lorentz transforms to reduce to the R = I case. However we do not wish to do this 
here as this method seems to rely heavily on the special properties of Minkowski 
space. Instead we shall use commutation with vector fields. 



In proving (|20|) we will actually prove a somewhat stronger result: 

Proposition 5.1. Let ^/R <ti A ^ R, and (f>, ip be free waves of frequency 1 and 
fi respectively, where /i <^ 1 . Suppose we have the energy normalization 

E{4>),E{^)<1 

and the localization properties 



' \x—xo\'>f^ 

and 

(/ |x-:EoP"°"|V,,t7/;(a;,io)|' dxf'^ <l 

•l\x-xo\:s>h 

for some xq G R" and Q <t{) < R/2. Then we have the fixed-time estimate 

IIQ('/'»(t)IL.(R") <(^)^"+'^/'i?('^)'/'i?W'/' + i?-'"" (30) 

for all R<t< 2R. 



Proof By translation invariance we may set a;o = 0- Also by a time translation 
and some redefinition of R if necessary we may set to = 0. 

Fix (j), ip, R, /i. In the region away from the light cone, i.e. |a;| < t/2 or \x\ > 2t, we 
can control (/) using ( p6| ) and -0 using energy estimates to easily obtain (|30|). Now 
suppose we are near the light cone, so that |a;| ^ t ^ R. 
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We shall write the null form Q in terms of the null frame Ai, E- developed in 
the previous section. More precisely, we claim the pointwise estimate 

\Q{M\<\P^\\'^^.M + \'^xMPM- 

To see this, we substitute in the identities di — Ai -\- f^i5+ — %E^ and dt = 
\Ej^ + \E-, and observe that in each of the three null forms Qo: Qii Qij the 
E^(j)E^tp term (which is the only potentially bad term) always vanishesFn. 



From the previous, (|2^) and Holder we have 

\\Q{<i>,m)h < ^{\\mt)\\oo\\v.Mt)\\2 + \\^.,Moo\\rm\\2)- 

The claim now follows from estimating (j) using decay estimates (p8|), (p9|), and ip 



using the standard energy estimate and (25) 



We now apply this Proposition with (j>, ip replaced by (j)T and VT' respectively, with 
to set to R/2, and xq set to the center of k. Recall that pa — rR^ is the size of k. 
By the Proposition and an norm in time we have 

If c is sufhciently small, and r is sufhciently close to ^fR, the claim follows. (Indeed 
from the argument we see that we just need r to be substantially less than iJ"/ ) . 



6. First proof of Lemma p.3| : Phase space preparation of the initial 

DATA. 



Fix /i, i?, r. It suffices to prove the claim under the assumption on the initial 
data that (/)t(0) — ±j\/— A(/)(0) for some sign ±; this is because general initial data 
can be split into two pieces of the above form via the formula 




0(0) ±i> 



-) 



Note that this decomposition is orthogonal with respect to the energy inner product 
(•, ■)e^ and does not affect the property of (p having frequency /i. 



We shall assume that 0t(O) — +i^/—/S.(t>{Q)\ the argument for the other sign is 
similar and is left to the reader. It shall be convenient to define the length scales 
s := R-^/^r and A := R-^'/^r. 



Since the (pT are free waves, to construct (j)T it will suffice to construct their initial 
data 0t[O]. By standard arguments we may partition 1 = g^. Xxq on R", where 
S C R" is a lattice of spacing ~ s, and Xxg is a bounded function with Fourier 
transform supported in the ball |CI M and which satisfies the estimates 

\Xxo{x)\ < R'^'p^'" whenever |a; - xo| > s (31) 

^^In principle the Qo null form should be somewhat better than Qi and Qij, as can be seen 
by inspection of the bilinear symbol (see e.g. [^), but this cannot be detected by this simple null 
frame analysis. 
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for any M > 0; the implicit constants may depend on M. For instance, one can 
set Xxo •= ^m/cXQ(3;q)j where the Q(xo) form a partition of R" into cubes of 
side-length ^ s. 

We also partition 1 = J^ujen '^^ S^~^ , where is a s/i?-separated set of direc- 
tions and rji^ is a bump function adapted to the disk of radius 0{s/R) centered at 

LU. 

We define the sector multipliers Pfj,^uj via the Fourier transform as 

where a is a bump function which equals one on a suitable annulus |CI ~ A* ^-nd is 
adapted to a dilate of this annulus. Thus P^.u: is a Fourier projection to a tubular 
region of length /i and width sfi/R, oriented in the direction lo. Observe from 
construction that s ■ fj, and s^/R ■ s are both > R'^ fj,. By the uncertainty principle 
we thus see that the convolution kernel K^j of P^^ui is integrable and obeys the 
estimates 

|Vii:„(x)| < R-^'fi-^^ whenever > s (32) 

for any M . 

Our collection of tubes T will be defined by 

T:={r(i?,r,.To,c^):a:oGE;coGr!}; 

note that the claimed cardinality bound on the intersection of T with any i?-cube 
is clear. 

Next, our (pT will be defined via their initial data as 

<pT{R,r,xo,u){^) Pti,uj{Xxo<P{^)) 

and 

at0T(O) :=i\/^0t(O). 
We will not need an error term for this argument and will set 4>error := 0. 

By construction it is clear that the 0t are free waves with frequency ~ /i, and that 
(|) holds. 

Observe that the physical space multipliers Xxq are almost orthogonal, as are the 
Fourier space multipliers P^i^uj- Thus ( pO| ) holds. Now we show (pi]). Since the time 
derivative of (pT is controlled by the spatial derivatives, it suffices to show that 

II E v.p^,.(x.o'^(o))ii2<i?^^( E \\^xP,Axx.mmf'^+R-''°'' 

{uj,xo)eX (i^,xa)<^X 

for all subsets X of x S. 
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Observe from Plancherel's theorem that the V^f'/^.o; multiphers are essentially or- 
thogonal as Lo varies. Thus it suffices to show that 

II E v.p^,.(x.o0(o))l|2;Si?^^(E \\^.P,Ax..m)\\lf''' + R-''''' 

xo&Y xo&Y 

for all e and F C S. 

Fix tj, Y . From (|3l|), (^) and the assumption E{(j)) < 1 we see that 

|V,P^,^(x.o(/.(0))(a;)| < (i? + Ix - xo|)~^V"^^ whenever |x - a;o| > A 

(33) 

for any M. Thus one can restrict xP^l,u){Xxo4>{^)){x) to a ball of radius 0(A) 
around xq by paying a total price of 0(i?^^^°"). Since these balls only overlap by 
about 0{R'^'^) or so, the claim follows. This proves (|ll|). 

It remains to show that each 0t is localized to T. Fix T; without loss of generality 
we may take xq = and u) = ei. We have to show (0), (||). We only show the 
bound (H); the bound (0) essentially follows from Sobolev (replacing r by r/2 
if necessary). We thus need to show 

(/ |V,,t0T(x,t)pdx)i/2<(i?+|:,_tei|)-ioo» (34) 

whenever |i| ~ R. 

We shall achieve this using vector fields by means of the following lemma, which 
will also be used in our second proof in the next section. 

Lemma 6.1. Let < r R, and let A = R^'^/'^r. Let 4>t be a free wave of 

some frequency /i 1 such that 

|V^^t0T(a;,O)| < (i?+ |a;|)-*V"*^ whenever \x\ > A (35) 

for any k > 1 and any M , with the implicit constant depending on k and M of 
course. Also, if E denotes the vector fields 

i::={dt+di,d2,... ,dn} 

then we assume the estimates 

^(S'^f/.T) <i?'°""(4T)'^' (36) 
it 

for all fc > 0. Then (||) holds for all \t\ - R. 

Proof Since (pT has frequency /i, it will suffice to show that 

(/ |Vj:+VT(x,t)|2da;)V2<(i^+|^_iei|)-io"V (37) 

J \x~tei I >r 

for some suitable k which we can choose at will. 

Fix t. From (|3^) and (^6|) we have the regularity estimate 

£;(r'=s>T)'/' < i?i"""(A//)^(^)-'' + i?-*V"*^ (38) 

it 

for any k,j > 0. 
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We now divide the region \x — tei\ > r into several sub-regions and prove the 
locahzation estimate ( |37| ) separately for each sub-region. 

Case 1. - |a;|| > r 

In this case we are away from the light cone and one can use the fact that the vector 
fields r span. 

By (|2^) we have 

\^'^yM^,t)\<r-''\r''v.,tM^,t)\ 

and hence by the regularity estimate ( p8| ) 

-'a::||t|-|2:||>r 

Since r — R^^'^A, we thus obtain ( ^7|) for this region by setting k sufficiently large. 
Case 2. - |a;|| < r, and xi > \t\/2. 

Without loss of generality we may take \x2\ > l^sl,... ,|x„|, thus \x2\ ^ r. In 
particular |aJ2| ^ r/R. The idea is now to use the fact that T together with 82 
spans. 

We now use the identity 

E^=E+ + —A2 - —82; 

UJ2 OJ2 

since and p form a null frame, we thus informally have 

v./'<"(i + -)(|),s). 

UJ2 

Since \uj2\ ^f/R, we thus obtain after repeated applications of the Leibnitz rule 

i+j<k 

for any k. Applying the regularity estimate ( ^8| ) and substituting A = R^^/'^r, we 
thus obtain 

J \x2\ 

The claim follows by setting k sufficiently large. 
Case 3. \\t\ - \x\\ < r, and xi < \t\/2. 

In this case we use the fact that p together with dt + di span. Specifically, we use 
the identity 

1 — Wl 1 — UJl 1 — LOl 

Since 1 1 — wi | ~ 1 by hypothesis, we thus informally have 
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One then argues as in Case 2. 



Fix t. We now apply the Lemma just proven to obtain the locahzation estimate 
(p4[). By repeating the derivation of the initial data localization estimate (33) we 
can obtain the pointwise estimates (|3^). We now verify the regularity estimate 

lei). 



,dn this is clear from the Fourier support of (f>T- Now 
- di. By repeated application of the equation Dcj)^ — 



For the derivatives 82, ■ ■ ■ 
consider the powers of dt 
we see that 



and hence 



(dt + d^fh 



On the other hand, the Fourier transform of Vj; ((/)t is supported in the region 
^ C — ^ici + 0{iis/R). In particular we have the crude estimate — 
i^i = 0{^s/R), and the claim follows. This concludes the proof of the localization 
estimate (p^), and Lemma 2.3 follows. 



7. Second proof of Lemma |2^: Spatial truncation at two distinct 

TIMES. 



In the previous argument, tube localization was achieved by localizing 0[O] in phys- 
ical space and then applying a Fourier projection. Although this method works, it 
is somewhat unsatisfactory as it (implicitly) uses the fact that the Fourier support 
of a free wave is preserved by the time evolution. This type of fact does not seem 
very stable in rough metrics, and so we would like to use a tube decomposition 
method which does not require as much usage of Fourier space. 

The idea is to characterize a tube by its intersection with two time slices t — and 
t — 3i? (cf. the strategy used to attack the Kakeya problem in Q). We shall write 
T for '3R. 

In this section it will be convenient to take s := R^/'^+^/'^ and A :— With 
this value of s, we may partition 1 = X^^oes -E^" in the previous section. 

Let U{t) denote the free evolution operator on free waves, thus U{t)(f)[0] :— (j)[t]. 
Then we may use linearity to split 

E c^(t)(x.ot^(-r)(x.>[T])). 

In other words, for each xq, Xt, we localize (j) to an s-neighbourhood of Xt at time 
T, evolve back by the free wave equation, localize to an s-neighbourhood of xo at 
time 0, and use this as initial data. As we shall see (using the fact that s > VR), 
the second localization will not significantly disturb the localization of the first. 



We divide the summation into two terms, depending on whether we have a null ray 
(in the sense that \xt — xo\ — t + 0(A)) or not. 
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First consider the contribution of the case ||xr| — |a:^o|| 7^ ^ + 0(A), so that the 
two points do not form a null ray. This case is small and will be absorbed into 
the error term 4>error- To prepare for this, we fix Xt, and define ip = ipx^ by 
tplt] := U{t — t)(xx^4'[t])- Thus -0 is a free wave of frequency /i which is essentially 
localized to the disk {(a:, t) : x — Xr + 0{s)} at time t. To exploit this we recall that 
p(^T,T) denotes the Poincare vector fields centered at the point (xt-, r) in spacetime. 
From the reproducing formula (|l|) and several applications of the Leibnitz rule we 
have 

i^((r(-^.^))fc^) < (,^)2fey'(i + h_^)-ioo«|v,^,<^(a;,r)p dx 

for any k, where of course the implicit constant depends on k. By the pointwise 
estimates (H) (adapted to F^^-'^) rather than T^^-^^)) we can thus bound 



( / I V^+V(a:, 0)p dxf'^ < A-'isf^n f (l+^^)-i"o„| ^^^^^(^^ rf,)i/2. 

J||a;|-T|>A J A 

Since ^ has frequency /i, we thus see from the pointwise bounds (^ that 

( / |V..,V(x,0)pdx)V2 < {s/An f (l+t^)-ioon|v^_,0(:,,r)|2dx)V2. 

J||x|-r|>A J 

By making k sufficiently large we may set (s/A)*^ < i?^^"*'". From this estimate it 
is easy to show that 

xo,x^e'S::\\x^\-\xo\\^T+0{A) 

which is acceptable. 

It thus remains to consider the terms when — |a:;o|| = + 0(A). In this case 
we can find a tube T = T{xq, Xr) which is a distance 0(A) from (xq, 0) and {xr, t), 
and we define 0T(i) = U {t){xxoU {~t){xx^4'['''])) accordingly. If we let T denote 
the collection of all such tubes (which we can of course make disjoint) then (||) 
holds. To show (|l0|), we observe from spatial orthogonality and the energy identity 
that 

E EiUmxxoUi-rKxxM))) 
= E E{xx„U{~T){xxMrm 

Xo,x^E'S 

< J2 EiUi~r){xxMr])) 

x^es 

= E ^ixxAM) 

x-res 

as desired. 

Now we show the localization estimate (^; the localization estimate (|^) is 
similar and follows from (^) (with r replaced by r/2) and the pointwise estimate 

(!)• 
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Without loss of generality we shall assume that xq — and Xr — rei + 0(A). In 
this case we need to prove the localization estimate (^). By Lemma |6.l| it suffices 
to verify the initial data localization estimate (pq) and the regularity estimate (p6h. 



The claim (35 ) follows easily from the decay of cutoff Xxo and crude energy estimates 
on (f>T (the Besscl inequality (p^) will do). Now we verify (p6|). Once again we set 
tp = tpx^ hy ^p[t] :— U{t ~ t)(xk^0[t]). If derivatives E land on the cutoff Xxg 
then one gains a power of s for each derivative (note that we can use the equation 
□ = to convert any repeated time derivative to a repeated space derivative). 
Thus it suffices to show that 



£;(x.oSV[o])<i^^''""(^)^ (39) 



We may replace Xxa with the characteristic function of the disk D := {(a;,0) : 
l^^ ~ 2;o| ^ A} since the rapid decay of Xxo ensures that the contribution outside 
this disk is acceptable. 

This disk is close to the light cone centered at (xr , t) . We will thus use the good 
derivatives p^^^''^\ being the vector fields p translated to be centered at {xt,t) 
and adapted to the backward time flow. 

We observe the identities 

dt + di = A^"-^"' - i(i - 4^-'^^)E^^-'^^ - i(i + c^("-"^)£;i=^-") 

where w^^^^"^) := (a; — Xr)/\x — x^] is the analogue of uj for the origin {xt-,t). On 
the disk D we have cjf^^^'^) = — ei + 0{A/R), and thus we informally have 

S" <" |)(--^) + Ay,,,. 
By many applications of the Leibnitz rule one thus obtains the rigorous estimates 
|V,,*SV(0)| < (A/i?)^-'|ViyxV(0)| 

i+j<k 

on the disk D with X ~ p^^^''^^ . To obtain the regularity estimate (|39|), it thus 
suffices to show that 

i+j<k 

Since i/j has frequency /i, it suffices by the reproducing formula (|^) to show that 

R 
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Observe that the "good" derivatives X < ir^^^^"^^. Thus 



rn<i 

The claim now follows from the Leibnitz rule, energy estimates for 0, and the 
frequency fj. hypothesis. This proves the localization estimate (||). 



Finally, we show the orthogonality estimate (]1 1|) . By the Cotlar-Knapp-Stein 
lemma (see e.g. |^) it suffices to show that 

l('/'T,0T')e| <{R+\xo~x'o\ + K - 
whenever |a;o — XqI + — x^j ^ r and T = T{xq, Xr), T' = T{x'q, x'^) are in T. 

Fix xo,x'q,Xt, x'^. Define Xt '■— xq + ^{xr — Xq) and x^ :— x'q + ^{x'^ — Xq) for each 
< i < r. Clearly we may find a time R < t < 2R such that \xt — Xf \ 3> r, which 
means that on this time slice the tubes T and T' are separated by ^ r. Since 
{ipT,(l>T')e = {(f>T[t],(t)T'[t])e, the claim follows from the localization estimate (||). 
This completes the second proof of Lemma 2.3. 



8. Additional remarks 



• The induction on scales argument in Section |3| lost a power of i?^ on a cube 
of length R. This i?^ loss can be removed in a number of ways. One possibility 
is to tighten the induction so that, if one inductively assumes a bilinear estimate 
with a constant of A (instead of something like R^), then one can deduce the 
same estimate with a constant of yl/2 + 0(1). By standard bootstrap arguments 
this gives the bound with A = 0(1). This can be done provided one performs 
extremely careful tube decompositions in which one does not lose any constant at 
all (not even a factor of 2!) in the main term of the orthogonality inequality (pi]), 
as well as the Bessel identity (|o|) . This approach was carried out in , however 
that argument is heavily dependent on Plancherel's identity (among other things) 
and will probably not be extendable to rough metrics. 

• Another option is to tighten the estimate to an estimate for some p < 
2 while still losing an R^ , in the spirit of |2^; this can be accomplished 
(for p > (n + 3)/(n + 1)) by using further geometric facts about the tubesP} 
Standard "epsilon-removal" arguments (see Q], related arguments are in 0, 
I21I ) exploiting some decay of the fundamental solution in a manner reminiscent of 

^■^In our current argument, the only geometric fact used is that any two transverse tubes only 
intersect in a localized region of spacetime. A further fact, used to good effect in | p9| (see also 
psi ) is that a tube which is transverse to an entire light cone (not just another tube) will still 
only intersect in a localized region. An alternative way to phrase this is that one can recover good 
energy estimates on light cones and similar null surfaces if one makes a transversality assumption. 
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the Tomas-Stein or Strichartz arguments, can then be used to trade in the R"^ loss 
at the expense of LP integrabihty. 

Since this argument does not appear to be well-known in the PDE literature we 
briefly sketch it as follows. Suppose for instance that we have obtained an estimate 
of the form 

for some p < 2 and all e > 0, i? ^ 1, and all free waves (j), ip oi frequency 1. We 
sketch how this estimate implies the global estimate 

\\mLi,<m'^'my^^- (40) 

For simplicity we shall reduce to the case when 0( = zLi\/—A(f>, and similarly for 
ip. In this case the energy inner product is equivalent to the usual inner product. 
We shall now cease to distinguish between (/)(t) and (j)[t] to simplify the notation. 

Firstly, we use finite speed of propagation to replace by a spacetime slab R" x /, 
where / is any interval of length R. By Sobolev embedding we may then replace 
the LP norm with an norm: 

Now we prove (^o|). It suffices to prove the weak- type estimate 
for q < p. By duality it suffices to show that 

/ mt)m\\Li dt < |f7|i/«i?(0)i/2i5(^)i/2. (42) 

This estimate is clear for \n\ < 1 just by crudely bounding (j> in and tp in L°° 
(for instance), so we may assume jfij ^ 1. 

To compare with (^2|), observe that the estimate (^l|) yields 

/ ii0(t)v;(i)iiL? rfi<if^r/n/ri?(0)'/'i?(^)'/' (43) 

Jnni 

for any interval /. This is superior if Q is concentrated in an interval / of size 
Odrij*^), but we still must consider the case when H, is widely dispersed along the 
time axis. To resolve this we use the decay of the fundamental solution as well as 
a TT* argument. 

To see this, we fix fl and let R := for some large C to be chosen later. We 
then choose e so that |ri|^/'' < We then partition the time axis R into 

disjoint intervals / of length R, and observe from (^), the hypothesis q < p, and 
Cauchy-Schwarz that we have the "weakly global" inequality 

J2 1 UiimmLi dt<\n\^/'^{Y,E{cpj)f/\Y.E{^Ji)Y'^ (44) 

, Jnni , , 
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for all free waves 0/, ■(/;/. Our next objective is to bootstrap this inequality to a 
"semi-weakly global" inequality 

J2 [ \m)i:j{t)u.dt<\n\^/''E{cpy/\j2E{ijj)y/\ (45) 

J Jnni J 
A similar bootstrap will then obtain (^). 

To obtain (^), we fix the t/i/ and write (j)[t] = [/(t)Pi0[O] (using the reproducing 
formula (]l|)) to rewrite the estimate as 

\Y,[ I u{t)p,mMmx,t) dt\ < \n\'/'^E{cb)'^'{j2E{i,j)f^^Fu^L. 

J Jnni J J 
By duality it suffices to show that 

We square this, and expand this as 

IE / / {PiU{-t)iMmt)),PiUi-s)iMs)Fis))) dsdt\ < m'/'^{j2mi)y^'\\F\\LrLir- 

J J Jnni JnnJ j (^^q^ 

On the other hand, if we start with (Bj) and apply similar considerations we obtain 



r Jnni Jnni r 



in other words, we already control the diagonal contribution I — J of (|4q). By 
Cauchy-Schwarz we also control the near-diagonal contribution when / and J are 
adjacent. It remains to consider the contribution when I and J are not adjacent. 
In this case \t — s\ > R. To exploit this we rewrite this contribution as 

J2 [ f {P^U{s-t){^Pj{t)F{t)),ijj{s)F{s)) dsdt\ 



I, J 



The decay estimate shows that PiU{s—t) maps to L°° with a norm of 0{R ^" ^''^^), 
thus we can estimate this by 



^-(„-i)/2^ r I e{^j)\\f\\l^l.e{^j)\\f\\l^ 

J J Jnni JnnJ 
which we can easily bound by 

R-(--'y'\n\\{Y^E{^pj)m\Fu^,.f, 



I 



which is acceptable if we choose R = sufficiently large. (Note that we do not 
need the optimal decay of (n — l)/2 to run this argument). This proves (|45|); the 
extension to (^) is done similarly and is omitted. This completes the derivation of 

lol). 



• In what follows we sketch a slightly different proof of Theorem 1.1, in which the 
parallel case is treated somewhat differently. We start, as before, with the tube 
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decomposition 

^« ^ 0t; tpT'- 

TeT T'eT' 

We observe that the wave packets (I)t, ipr verify the properties (Q) and (||) of 
definition 2.2, not only for R/2 < t < 5R/2 but also at t = 0. This can easily 
be checked from either of the two construction of the wave packets. For simplicity 
we shall assume that in fact, at i = 0, both (pT, i^T are localized to balls of radius 
r = i?2+'^. As before we write: 

g(^,^)«^ ^ Q(0T,^T')~Qii + Qx 

tgTt'sT' 

with the transversal interaction component 
and the parallel interaction component 

Q\\ = X! 

Recall that, 

<^K ^ 0T and V'k := X! 

TeT(fc) T'gT'(k) 
Due to the additional localization properties of the wave packets at t = we observe 
that ^fc(O), '0fe(O) are essentially localized in the cube k. We start by estimating 
Recalling, the estimate 

||g(</'fe,^fe')Wll2 < ;^(|lr0fe(O||oo||V,^tVfc' WII2 + \\V..tM\oo\\Ti^U'{t)h). 
we write 

ll<3||lll,2(QW) < 51 IIQ(</'«^,^K')lli2(QW) 

^ XT 

K^K,' ^]\.:k^k.' ,1{k)—1{k,')—pq 

R 

We can therefore chose po = rather than po « as used earlier. The 

proof of the estimate for Qx proceeds exactly as before. 



9. Proof of Theorem [l^ 

Fix (f), ip, p, A, q, r. By a scaling argument we may take A = 1. We may assume 
that ^ 1 since the left-hand side vanishes otherwise. Our task is now to prove 

< p--l-'^E{^)^/'E{^)^^\ (47) 
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If one simply discards the projection and uses Holder and Strichartz then one 
does not obtain the decay in fi in (^). To recover this decay we will use the 
averaging property of at scale l/fi (given by (|2|)), interacted against the fact 
that the fundamental solution at frequency 1 is essentially supported on a set of 
"thickness" 1. (In this we are very much in the spirit of p^). 

We divide space R" into cubes Q of side-length From (|^) we have 

\\p,mm)h:f^^R-j = (Eii^a.w)V'(^))ii;(/.(q//^ 

Q 

By (||) we have 

\\PMit)m)h:^^^Q) < m"^'-'^ E(i + A^dist(Q, Q'))-''"^4>{t)m\\LiiQ). 

Q' 

Inserting this into the previous and using Young's inequality, we obtain 

\\Pim)m)h:^^^R-^ < m"^'-'HE ii<^wvxi)iii((Q))'/'-. 

Q 

To prove (^7|), it thus suffices by Holder and Cauchy-Schwartz to show the "im- 
proved Strichartz" estimate 

IKE MtWLiiQ)y^i\L^, < f^-'Ei^y/'. (48) 

Q 

together with the identical estimate for ip. 

It suffices to prove (^8|) for 0. We remark that this estimate has essentially appeared 
in and we shall repeat the proof here. 

Observe that the claim is trivial when q = oo (thanks to the embedding F C P), 
so we will assume that g < oo. This forces r to be strictly greater than 2. 

We shall use the abstract Strichartz estimate in (Theorem 10.1). Let H be the 
Hilbert space of initial data 0[O] with frequency 1 with the energy inner product. 
We let U{t) : H ^ H he the map U{t)(f)[0] := (p[t]; note that this map is unitary 
on H. For any < < 1, let denote the Banach space given by the norm 

wmwBs :=/^~"^(Eii'^wirL|(Q))'^''' 

Q 

where rg := 2/(1 + 6) and a :— ^(^^-2) ■ Since {q,r) is admissible we observe that 
< (T < (n — l)/2. Observe that the dual space Bg is given by 

ii#]iiB;:=/'^(Eii'/'Wir4(Q))'/''^- 

Q 

Also, one has the real interpolation embedding {Bq, Bl)g_2 Q Bg (see e.g. [||). 
To prove ( ^ ) it will suffice to prove the estimate 

wummiLiB; < \\m\\H 

with := 1 - 2/r (so that r'g ^ r). 
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Observe that q ~ 2/a9. By the abstract Strichartz estimate in Theorem 10.1, 
we see that it will suffice to prove the decay estimate 

\\u{t)u{srci>[s]\\B',<\t-sr\m\\B, 

for all s,t. From the group properties of the evolution we see that U{t)U{s)* = 
U(t — s), so by time translation invariance it suffices to show that 

\\u{t)m\\B',<\tr\\m\\B,- 

We expand out the norms to write this as 

sup 1 1 0[t] I U2 < /i^i t r " ^ 1 1 [0] I . 

It thus suffices to show that 

\\m\\L■uQ)<il/^^'\t\rY.\\^l^^hl(Q') 

Q' 

for all cubes Q of side-length fi. 

We may assume that \t\ 3> /i^^ since the claim follows immediately from energy 
estimates otherwise. 

We shall assume zero initial velocity (/'t(O) = 0; the case of zero initial position is 
similar and is left to the reader. By the reproducing formula we may write 
0(0) = 0(0) * ^/'(O) for some Schwartz function ^/'(O) with frequency 1. We extend 
"0 to be a free wave by setting zero initial velocity ■0(0) = 0. By uniqueness we see 
that 0[i] = 0(0) * 0[t], thus ip[t] is a frequency-localized fundamental solution for 
the wave equation. By Young's inequality we thus see that it suffices to show the 
decay estimate 

II^WIlLi,(Q)<(l/M'|i|)^ 

for all cubes Q of side-length l//i. 

Since "0(0] is a Schwartz function, we have the standard decay estimate 

|Vl,^(x,t)|<|r^"-'^/'(l + l|i|-NI)-' 

(see e.g. one can also use the machinery of Section ^ to achieve this bound). 
Since ij) has frequency 1, we thus have in particular that 

for any k. A computation then shows that 

\\m\\LiiQ)<{ihm^''-'^'^- 

Since ct < (n — l)/2, the claim follows. This completes the proof of Theorem pT^ . 



10. Appendix: Proof of Corollaries 1.3, 1.4 



We now prove Corollary 1.2. We first begin with the preliminary remark that in 
Theorem the spacetime cube can be replaced with the slab [0, R\ x R". To 
see this we divide into cases. 
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Case 1: i?niin(A,/i) > 1. 



In this case we use finite speed of propagation and smootli cutoff functions applied 
to tfie initial data to replace the energy E{(j>) in by the more localized quantity 

J 0)p + |V<^(., 0)^(1 + dist((x^O),Q^) )_ioo» 

(One has to construct the cutoff functions so as not to destroy the property of having 
frequency A, but since RX > 1 this can be done without violating the uncertainty 
principle. See p3[ for further discussion). Similarly for E^ip). The claim then 
follows by partitioning [0, i?] x R" into cubes Q^, applying the above localized 
estimates, and then Cauchy-Schwarz. 



Case 2: i?min(A,^) < 1. 



Without loss of generality we may assume that A < /x. From the definition of energy 
and Bernstein's inequality we see that 

and 

The claim then follows from Holder. 



We now return to the proof of Corollary 1.3. Fix Q, (j), ip. We use suitably chosen 
Littlewood-Paley projections to split 

Q(0,V^)=^Q(Pa</',PmV'). 

A, /I 

We split into three cases: the high-low case A ^ /x, the high-high case A ~ /i, and 
the low-high case A ^ /i. 

In the high-low case we see that Q{Px(j), P^j^ip) has Fourier transform supported in 
the region |^| ~ A. By orthogonality we thus have 



A>At 



A fj,:ti^X 



First consider the contribution when /i 1. Applying Theorem 1.1 (with the 
previously mentioned modification) we may estimate this by 

^ (E( E ^^''/'E{p,cj^y/^EiP,4'my^'. 

A /^:^<1 

This easily sums to be bounded by E{(j)y^^E{tpY/^ , which is acceptable. 



Now consider the contribution when fi > 1. Again using Theorem 
bound this by 

^(E( E t^^"-'^^'^'E{Px^y/'EiP,yjy/Y)'/'. 

A /j:l</i<A 



1.1 



, we may 
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This can be controlled by 



A 

as desired. 



In the high-high case we use the triangle inequality followed by Theorem In 
the /i ^ 1 case we thus obtain a bound of 



J2 fi'^^'EiPx^by^'EiP^i;)'/^ < Eicpy/^Eiijy^ 



1/2 

which is acceptable. In the /i ^ 1 case we obtain a bound of 

which by Cauchy-Schwarz is bounded by _E((_D)("~^^/2+e0')i/2^^^')i/2 desired. 

The low-high case is similar to the high-low case. We only show the treatment of 
the A > 1 contribution. By orthogonality and Theorem |1 . 1| , this is bounded by 

^ E A("-l)/2+^S(P;,0)l/2^(p^^)l/2)2)l/2 < £;( (i^) ("^ l)/2+2.^)^(p^^) ) 1/2 

M A:l<A</i A* 

which is acceptable (replacing e by e/2 as necessary). This concludes the proof of 



Corollary 1.3 



We now prove Corollary 1.4. We use suitably chosen Littlewood-Paley projections 
and the triangle inequality to split 



X.fi 

and consider the high-low, high-high, and low-low cases separately. 

In the high- low case X ^ fj, the symbol |-D|~'^ can be replaced by A^"^. We then 
use Holder and Strichartz' inequality (see e.g. to estimate 

< X-" X^-^-"^ E{Px(f>Y/'^fi^'-^''^ EiPf^i^y/^ . 
Thus we may estimate this contribution by 

^A--A^-i-^ii;(FA<^)i/2(^^t-i-fi5(p^V^)i/2). 

A M<> 

The inner sum can be estimated by 

Mt-^-i5(p^^)i/2 < X^E{\D\^-'.-H)'^', 

/J<A 

and the claim then follows by performing the A summation. 
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The low-high case A <C ^ is similar, so we turn to the high-high case X ^ fj,. To 
simplify the notation we shall take A = /i, although this does not substantially 
affect the argument. By Cauchy-Schwarz it suffices to show that 

From scale invariance we may take A ~ f , so that the derivatives \D\ on the right 
hand side may be discarded. 



The function P\(f)P\%lj has frequency < 1. By the Littlewood-Paley decomposition 
and the triangle inequality it thus suffices to show that 

\\P,\D\-%Px(j>Px4')\\^./2^.r, < v'E{Px(j)f^''E{Pxi>Y'^ 
for some e> 0. But we may replace l-Dl""^ by v^"^ , and the claim then follows from 



Theorem L2 and the hypothesis a < n ^ 



4 2n 
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